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Abstract 

Cesaro convergence of spherical averages is proven for measure- 
preserving actions of Markov semigroups and groups. Convergence in 
the mean is established for functions in L p , 1 < p < oo, and pointwise 
convergence for functions in L°°. In particular, for measure-preserving 
actions of word hyperbolic groups (in the sense of Gromov) we obtain 
Cesaro convergence of spherical averages with respect to any symmet- 
ric set of generators. 

1 Introduction 

1.1 Formulation of the main results 

Let T be a finitely generated semigroup. Choice of a finite set of gener- 
ators O endows T with a norm | • |o : for g 6 T the number \g\o is the 
length of the shortest word over the alphabet O representing g. Denote 
So(n) = {g : \g\o = n}. 

Assume that the semigroup T acts on a probability space (X, v) 
by measure-preserving transformations, and for g e T let T g be the corre- 
sponding map. Now take if G L 1 (A, z/) and consider the sequence of its 
spherical averages 

g&S Q {n) 
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(here and everywhere # stands for the cardinality of a finite set; if S (n) = 0, 
then we set s n (ip) = 0). Next, consider the Cesaro averages of the spherical 
averages: 



The main result of this paper establishes mean convergence of the av- 
erages cjy{(p) for ip G L l (X,i/) and pointwise convergence of cn{<£>) 
for ip G L°°(X,u) in the case when T is a Markov semigroup with re- 
spect to the generating set O. 

Recall the definition of Markov semigroups. As before, let T be a semi- 
group with a finite generating set O. For a finite directed graph G with 
the set of arcs S(G), a labelling on G is a map £: £(G) — > O. Let Vq be 
a vertex of G and let V(G,vq) be the set of all finite paths in G starting 
at vq. To each path p — e\ . . . e n G V(G, vq) we assign an element G T 
by the formula 



The semigroup T is called Markov with respect to a finite generating set O 
if there exists a finite directed graph G, a vertex vq of G, and a labelling 
£: £(G) — > O such that the lifted map £: P(G,^o) — > T is a bijection, and, 
furthermore, for a path p G "P(G, t>o) of length n we have |^(p)|o = n. 

For example, a theorem by Gromov [15] states that a word hyper- 
bolic group is Markov with respect to any symmetric set of generators 
(for cocompact groups of isometries of Lobachevsky spaces, the Markov 
property had been established earlier by Cannon [9]; a detailed exposition 
of the proof of Gromov's theorem can be found in the book of Ghys and 
de la Harpe [11]). 

We are now ready to formulate the main result of the paper. 

Theorem 1. Let T be a Markov semigroup with respect to a finite generat- 
ing set O. Assume that T acts by measure-preserving transformations on a 
probability space (X, u). Then for any p, 1 < p < oo, and any ip G L P (X, v) 
the sequence of Cesaro averages of its spherical averages 



converges in L v [X,v) as N — > oo. If, additionally, ip G L°°(X,v), then 
the sequence cn{^) converges v- almost everywhere as N — > oo. 



n=0 



^)=e(ei)...e(e„). 




N-l 
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Corollary 1. Let Y be an infinite word hyperbolic group {in the sense of Gro- 
mov), and let O be a finite symmetric generating set for Y . Assume that 
Y acts by measure-preserving transformations on a probability space (X,u). 
Then for any p, 1 < p < oo, and any (p G L P (X, v) the sequence of Cesaro 
averages of its spherical averages 



converges in L p (X,v) as N — >■ oo. If, additionally, (p G L°°(X,v), then 
the sequence cn{^) converges v- almost everywhere as N — >■ oo. 

Under additional assumption of exponential mixing of the action, 
pointwise Cesaro convergence for spherical averages of functions from L 2 
for measure-preserving actions of word hyperbolic groups was obtained by 
Fujiwara and Nevo [10]. L. Bowen [2] proved convergence of spherical aver- 
ages for actions of word hyperbolic groups on finite spaces. Both Fujiwara 
and Nevo [10] and L. Bowen [2] also proved that in their setting the limit is 
invariant under the action. 

Our result applies to all measure-preserving actions of all finitely- 
generated infinite word hyperbolic groups. Our argument, however, does not 
give any information about the limit. 

Question. In Theorem 1, when is it true that the limit is T-invariant? 
We conjecture that it always is in Corollary 1. 

1.2 History 

First ergodic theorems for measure-preserving actions of arbitrary countable 
groups were obtained by Oseledets in 1965 [21]. Oseledets endows a countable 
group T with a probability distribution /i satisfying /i(g) = g G Y, 

and establishes pointwise convergence of the sequence of operators 



as n — > oo (here fj,^ stands for the k-th convolution of the measure //). 
To prove pointwise convergence Oseledets uses the martingale theorem 
in the space of trajectories of the Markov chain corresponding to the self- 
adjoint Markov operator S[^; the argument of Oseledets is thus a precursor, 
in the self-adjoint case, of Rota's "Alternierende Verfahren" argument [22]. 




N-l 
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For uniform spherical averages corresponding to measure-preserving ac- 
tions of free groups convergence in the mean was established by Y. Guivarc'h 
[16], who used earlier work of Arnold and Krylov [1] on equidistribution of two 
rotations of the sphere. 

In 1986, R.I. Grigorchuk [12] (see also [13], [14]) obtained pointwise con- 
vergence of Cesaro averages of uniform spherical averages of /^-functions 
for measure-preserving actions of free groups. The limit is invariant under 
the action of the group. 

For functions in L 2 , pointwise convergence of uniform spherical averages 
themselves was established in 1994 by Nevo [17], and for functions in L p , 
p > 1, by Nevo and Stein [19]. The limit was proven to be invariant under 
the subgroup of elements of even length. Whether convergence of uniform 
spherical averages holds for functions in L l remains an open problem (re- 
call that, as Ornstein showed [20], powers of a self-adjoint Markov operator 
applied to a function in L 1 need not converge almost surely). 

In [7], pointwise convergence of uniform spherical averages is obtained 
by applying Rota's "Alternierende Verfahren" Theorem to a special Markov 
operator assigned to the action. This approach also yields pointwise conver- 
gence of non-uniform spherical averages corresponding to Markovian weights 
satisfying a symmetry condition [7]. 

Convergence of Cesaro averages on non-uniform spherical averages for ac- 
tions of free groups and free semigroups holds for general Markovian (and, 
in fact, for general stationary) weights [4], [5], [6]. The motivation behind 
considering such Markovian weights is precisely to establish ergodic theorems 
for actions of Markov groups, in particular, of word hyperbolic groups. 

The results of [6], however, can only be applied to groups that are coded 
by admissible words in an irreducible Markov chain; in fact, to prove invari- 
ance of the limit function, even a stronger condition is needed, which is called 
strict irreducibility in [6] and is equivalent to the triviality of the symmetric 
ex-algebra of the corresponding Markov chain with finitely many states. 

For some groups, a Markov coding is known explicitly: for instance, 
for Fuchsian groups such a coding has been constructed by Series [23]. 
The Series coding does in fact have the strict irreducibility property, and 
pointwise convergence of Cesaro averages of uniform spherical averages 
for measure-preserving actions of Fuchsian groups and for functions in L l is 
established in [8], extending the earlier theorem of Fujiwara and Nevo [10] 
for functions in L 2 . 

For general word hyperbolic groups, however, it is not clear whether 
the Markov coding is irreducible. The main result of this paper is that 
convergence of Cesaro averages of spherical averages still holds without the 
irreducibility assumption. 
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2 Paths and operators 

Let us introduce some notation regarding a directed graph from the definition 
of Markov groups. Consider a finite directed graph G (loops and multiple 
edges are permitted). The sets of vertices and edges (arcs) of G are denoted 
as V(G) and £(G) respectively. For an edge e, 1(e) and F(e) are its initial 
(tail) and terminal (head) vertices. Denote 

£(G,u,v) = {ee S(G) | /(e) = u,F(e) = v}. 

Then, let V(G) be the set of finite paths in G, that is, 

V(G) = {l = ei e 2 ...e k | I( ej ) = F( ej _i)}. 

Denote by |/| the length of a path I. 

Let (X,u) be a probability space. Assume that to every arc e G £{G) 
a measure-preserving transformation T e of (X, v) is assigned. In this case 
we say that G is labelled by measure-preserving transformations of (X, v). 

The map e >->■ T e is naturally extended onto V(G) by formula 

T =T T 

The action of 7], I e V(G), induces a standard action on the space LP(X, v): 
Ti(<f) — <f oT|. For any finite subset L C V(G) introduce an operator s(L) 
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on L P (X, v) acting by the formula 



~ leL 



if L ^ 0; we set s(0) = 0. 
In particular, denote 



Kv,n = G V(G) | 1(1) = u, F(l) = v, \l\ = n}, 

that is, L^ v n is the set of all paths from u to v of length n. Define s^ v , 
s(L^ ) and let c^ v N be their Cesaro averages: 



AT-1 

-u,v,N 



" — \ Q G 

' AT / / u,v,n' 



n=0 



Analogously, denote Lg, >n = LUv(G) and define 



JV-1 



G _ = /rG \ G j_ S G 



N 

n=0 



Theorem 2. Le£ G be a finite directed graph labelled by measure-preserving 
transformations of a probability space (X, v). Then for operators c^ vN and 



c ih),*,n defined above, the following statements hold. 

la. For any ip G L P (X, v), p G [1, oo), the sequence {c^ v n(p)}n=i converges 
mL p (X,u). 

lb. For any <p G L°°(X,u) the sequence {c^ vN (<f)}^ =1 converges v-almost 
everywhere. 

2a. For any ip G L P (X, u), p G [1, oo), the sequence {c G * iv(v)}?/=i converges 
inL p (X,v). 

2b. For any y? G L°°(X,v) the sequence {c^? * N (ip)}x =1 converges v-almost 
everywhere. 

Statements 2a-b of Theorem 2 immediately imply Theorem 1. Indeed, if 
we assign the map T^ e ) to an edge e, then 

S n(p) = sf Q ^ n (cp). 

Now we proceed to the proof of Theorem 2. Define a square matrix M(G) 
of order #V(G) with entries being operators on L 1 (X, z/) by the formula 

M(G) tt)W = ^ T e . 

eg£(G,M,-u) 



Denote also M°(G) Ujt) = #E(G,u,v). Note that if 1 is the function that 
equals 1 everywhere, then T e l = 1 for any e G £(G). Define the following 
class of operators. 

Definition 1. A class B + of operators on L l (X, v) is a set of all operators 
A: V-(X, v) ->■ L X (X, i/) such that 

1. there exists A(A)gR such that A(l) — X(A) ■ 1, 

2. if / > (that is, /(x) > for almost all a: G X) then A/ > 0, 

3. A(L p (X, v)) C L p (X, v) for all p G [1, oo], 

4. < \{A)\\j\\ v for any p G [1, oo], / G D>(X, v). 

It is clear that this class is a convex cone, that is, it is closed under linear 
combinations with nonnegative coefficients. Since all T e 's belong to this class, 
the same is true for M(G) U; „, and 

X(M(G) V>V )= X ^= E UM °( G )v 

Then, consider an n-th power of the graph G, that is, a graph G' = G n , 
where V(G') = V(G), £{G') = {I G P(G),|/| = n}, and /(/) = 7(d), 
F(l)=F(e n ) for I = e 1 ...e n e£(G > ). 

By definition, M(G n ) u<v = £, 6L g 7j. It is also clear that (M(G)) n = 
M(G n ), and 

A((M(G) B ) tt ,„) = (M°(G) n ) UtV = #L° >B . 

Now if we define an operation P on the class B + as P(T) = T/A(T) if T 7^ 0, 
P(0) = 0, then we have 



AT-l 

a.?... = P((M(G»)) tt ,„), cg WiJV = - £ P((M(G")) U ,„). 



n=0 

Similarly, 

JV-1 



P £ (M(G")U,A cg^ = l^P £ (M(G"))^ 

Wv(G) / «=0 V^eV(G) 
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3 The Main Lemma 



The proof of statements la-b of Theorem 2 is obtained through a decom- 
position of the graph G into smaller blocks. The basic (non-decomposable) 
situation is the case of a strongly connected graph (that is, a graph such 
that for any its vertices u, v there exists a path from u to v) and in this 
case the theorem is proven in [6]. A step of the procedure starts with a 
decomposition of the set V(G) into two disjoint nonempty sets Vi, V2 with 
no arcs from V2 to V\. Then we apply Theorem 2 to the induced subgraphs 
with these sets of vertices (that is, a graphs Gj, i = 1,2, with V(G;) = Vi 
and £(Gj) = {e G £(Gj) : 1(e), F(e) G Vi}), and use Lemma 1 (see below), 
which is the main technical statement of the paper. The statements 2a-b of 
Theorem 2 are deduced from the statements la-b using the same lemma. 



Definition 2. A sequence {x n }%L , x n > 0, is called regular if there exists 
a number q G N such that for each r = 0, . . . , q — 1 one of the following 
statements holds: 



(a) x 



qk+r 



for all but finite number of k > 0, 
1 for some a > 0, b G N, c > 1. 

Definition 3. A sequence {T n } n , T n G B + , is called pre- convergent if 
1. the sequence {A(T n )} n is regular; 



(b) lim^ 

v ' k^oo ak b c k 



for any <p G L v [X,v) the sequence | — P(T n )((/?) 
in L P (X, i/) as N ->■ 00; 



iV-l 



n=0 



converges 



converges 



3. for any ip G L°°(X,v) the sequence |— P(T n )(</?) 

almost everywhere as — > 00. 

In these terms, Theorem 2 can be reformulated as follows. 

Proposition 1. Under conditions of Theorem 2 the following statements 
hold. 

1. For any induced subgraph G' of the graph G the sequence {(M(G') n ) u ^} n 
is pre- convergent for any u,v G V(G'). 

2. The sequence 

( ]T (M(Gy% thV 

^ev(G) 

is pre- convergent for any t> G V(G). 
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The first statement of Proposition 1 is equivalent to the statements la-b 
of Theorem 2 for all induced subgraphs of G. This is convenient for our 
inductive argument. The basis for the induction is the following theorem. 

Theorem 3 ([6]). If a graph G is strongly connected, then the sequence 
{(M(G) n ) UjV } n is pre- convergent for any u,v e V(G). 

Remark. 1. Regularity of the sequence {A((M(G) n ) u>t ,)} n = {(M°(G) n ) v }„ 
in the case of strongly connected graph follows from the Perron — Frobenius 
theorem. 

2. Convergence of c^ vN in L l (X, v) and almost everywhere (for functions 
in L X {X, v)) is shown in [6] (see Theorems 1, 2; note that strong connectiv- 
ity of G is called irreducibility of A = M°(G) in [6]). L p - convergence for 
functions in L P (X, v) follows immediately. 

The step of the inductive procedure relies on the following lemma. 

Lemma 1. If sequences {F n } and {G n } of operators from the class B + are 
pre- convergent, then the following ones are also pre- convergent: 

1. {H^} n , HP = F n for n > n , G B+; 

2. {Hi 2) = F n+M } n for any M e Z; 

3. {H^ a) = AF n } n , {M 3b) = F n A} n , where A e B+; 

4. {M 4) = F n + G n } n - 

5. {Hn ^ = J2 k+m=n F k G m } n . 

We now derive Proposition 1 from Lemma 1. 

Proof of Proposition 1. 1. The proof of the first statement is by induction 
on the number of vertices in G'. 

(a) Any graph G' with #V(G') = 1 is strongly connected, thus we can 
apply Theorem 3. 

(b) Take any induced subgraph G' with k vertices and suppose that 
the statement holds for any induced subgraph of G with less than k ver- 
tices. Then there are two cases: (1) G' is strongly connected; (2) G' can be 
decomposed as follows: V(G') = V\ U V 2 , Vi,2 7^ , and there are no arcs 
from V2 to V\. 

In the first case we may apply Theorem 3. In the second case consider 
graphs Gi,2 that are induced subgraphs with V(Gj) = V*. Since Gi j2 have 
less that k vertices, the theorem holds for them. 
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Now consider c^' vN - If u, v G V i: i = 1,2, a path from u to v can't 
leave G*, so (M(G') n j^ = (M(Gi) n ) u ,„, hence c^ N = c^ N , and the state- 
ment is reduced to the one for Gj. The case u G V 2 , t> G Vi is even simpler: 
there are no paths from u to i>, so c„ „ N = for all iV. 

The only nontrivial case is u G Vi, t> G V2. Here 

(M(G') n ) U) , = £ (M(G 1 ) fc ) uy M(G') u 'y(M(G 2 ) m )^,, 

fc+m=n— 1 

and the statement follows from Lemma 1. Indeed, by assumption, the se- 
quence {(M(Gi) n ) U)U /}„ is pre-convergent, hence, by item 3b of this lemma, 
the sequence {{M(G\) n ) u , u ,M{G') u ,, v ,} n is. Then, as {{M{G 2 ) n ) v > , v }„ is pre- 
convergent by assumption, item 5 gives us that 

{^ r >y = J2 (M(G 1 ) k ) u , u ,M(G') u ,, v ,(M(G 2 ) m )vi, v } n 

k+m=n 

is also pre-convergent. Now {A™'™{ v ' v } n is pre-convergent by item 2, and, 
finally the sequence 

{(M(GT)«,„= E ^ y } n 
D'eVi,w'ey2 

is pre-convergent by item 4 of Lemma 1 . 

2. The second statement follows immediately from item 4 of Lemma 1. 

□ 

4 Proof of Lemma 1 

The rest of the paper is devoted to the proof of Lemma 1. The proof will 
often use the following proposition. 

Proposition 2. Let A G B + , (p n G L°°(X,v), ||y? n ||oo < C, (p n (x) — >■ (p(x) 
for almost all x G X . Then (Aip n )(x) — > (Aip)(x) for almost all x G X . 

Proof. Clearly, it is sufficient to prove this only for tp — 0. 

Further, decompose ip n as ip n — ip+ — ip~, where ip^ = max(0, ±</? n ), 
\\<Pn Woo < llVnlloo < C- Therefore, if we prove that A</?^ 0, then A</? n = 
A<£>+ — 0. So we can assume that ip n > 0. 

Now, take ip n (x) = sup{(/?fc(x) | > n}. Then ijj n (x) is monotonically 
nonincreasing and tends to zero for almost all x G X. Since < ip n < ip n , 
the same is true for their images: < A<p n < Aip n and therefore, it is 
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sufficient to prove that Aip n JLf^. o. But as Aip n (x) is nonnegative and 
nonincreasing, there is a limit 6{x) = lim^^ Aip n (x), and, by monotone 
convergence theorem, 

PVn-0||l= / #„(l) " 0{X) dv{x) -)■ 

Therefore, A-0„ — >• in z/). But A is a bounded operator in L X (X, v) 

and HV'nlli — > (also due to monotone convergence theorem), so Aip n — > 
in L l (X, v). Thus #(x) = almost everywhere. □ 

Proof of Lemma 1. The plan of the proof is the following. After some prepa- 
rations, we'll prove the first condition in Definition 3 for all sequences {Hn } n 
(here and below the asterisk * denotes one of the symbols 1, 2, 3a, 3b, 4, 5), 
and then we'll prove the second and the third condtions of that Definition 
simultaneously. 

I. First of all, it is sufficient to prove that this lemma holds for the 
sequences {\(F n )} and {\(G n )} satisfying Definition 2 with q — 1 (and that 
in this case the sequence {X(Hn^)} is also regular with q = 1). 

Indeed, in general case we take q to be the least common multiple of qp 
and qc (i- e., g's from Definition 2 for the sequences {F n } n and {G n } n ). 
For * 7^ 5, it is clear that for a given r = 0, . . . , q— 1 the sequence {Hql\_ r } s de- 
pends in the same fashion on one of {F qs+r i} s and {G qs+r "} s with some r', r" . 

Now consider * = 5. Let k = qu + r' , m = qv + r" (u, v > 0, < r', r" < 
q — 1) and decompose the sum 

FkG m 

k+m=qs+r 

into g sums corresponding to all possible pairs (r 1 , r") (there are only q pos- 
sibilities, since r' + r" = r (mod q)): 

tt(5) ST <l r '< r " 

11 qs+r / j i 

r'+r"=r (mod q) 

where 

Or' ,r" _ \ 771 y^Y 

— 2^ *V+r'<J. 

tt,?J>0 

(qu+r')+(qv+r")=qs+r 

that is, the sequence {Sl' ,r "} s is the convolution of the sequences {F qs+r i} s 
and {G 9S+r »} s shifted by r ~ r ^~ r " G {-1,0}. 



qu+r'G qv+r" j 



m,u>0 
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2. Let us prove that the sequences {X(Hn^)} n are regular. For * = 
1, 2, 3a, 3b this is clear from the definitions. Let * = 4. If {X(F n )} or {\(G n )} 
contains only finitely many nonzero elements, this is clear. Otherwise, let 
(ap,bp,CF) and (a<3, &g, c^) be the constants given in Definition 2 for these 
sequences. 

If (1) cg < cf or (2) cg = cf, &g < &f, then 

lim = 0, 



SO 



fe^oo apk bF c' F 
X(Fk + Gfc 



lim 



1. 



fc^-oo a F k bF c k F 

The symmetric cases (1') cp < cg; (2') cf = cq, bp < &g are similar. The 
only remaining case is cp = cq = c, bp = b G = b. Here 

X(F k + G k ) 



lim 



= 1. 



fe^oo (a,F + CLG)k b C k 

Now let * = 5. The case of finitely many nonzeros is again clear, otherwise 
we can assume that cf > cq- There are two cases, cf > cg and cf = cq. 
Suppose that cf > cq- Then 

E X(F k )\(G m ) n 



k+m=n 



a F (n + l) b Fc F 



m=0 



a F (n + l) b Fc F 



Let us prove that this sum tends to Em=o ^(G m )c F m . 
Denote 1 

A(F n ) Q \{G m ) 



Pn 



a F {n + l) b Fc n F ^ F 

and fix e > 0. Note that the series X]m = o A(G m )cp m = J2m=ofin converges 
absolutely, so there is m such that E m >m Pm < £ - Let A be an upper 
bound for all a n , n > 1 (it exists since a n — > 1). Then 



E C n +i +1 ) F - E a 



m=0 



< 



m 



5> 



m=0 



m=0 



< 



/ m \ b F 
\ ~ n+l) 



+ 



+ 



n 

/ , PmO^n—m I J- , -i ; 

, V n + l/ 



6 F 



m=mo + l 



m=mo+l 



1 We write (n + l) fc in the denominator instead of n b to have well-defined ceo- Never- 
theless, a n tends to 1. 
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The last term is less than e, the second one is less than As and, if n is 
sufficiently large, the first term is less than e, hence the whole difference is 
less than (2 + A)e for sufficiently large n. Thus, {X(Hn^} is regular with 

CO 

a H = a F ^ K G m)c~p m , b H = b F , c H = c F . 
Now let cp = cq = c. In this case we have 

a F a G (n + l) b F +b G+ 1 c n 



]_ A X(F k ) A(G ra _ fc ) ( k + 1 \ hF (i _ JLJ) bG 

+ 1 t^ Q a F {,k + 1)^4 a G (n - k + l) b G C n G ~ k \n + l) \ n + V 



n 



(1) 

and denote a k , f3 n - k and 7^ as it is shown here. Let us show that 

Y n 1 n 

lim — — a fc /3n-fc7n,fc — V] 7n,fc = 0. 

n->oo n + 1 n + 1 

k=0 k=0 

Indeed, by Definition 2, the sequences {a k } {Pk} tends to 1, hence there are 
A, B such that a k < A, j3 k < B for all k. Take any e < 1 and find p such 
that \a k — 1| < e, \/3 k — 1] < e for all k > p. Then 



^ n 1 n 

A n = — OLk$n-kln,k — 7 /] 7n,fc = 

n+1 ^-^ n + 1 ^-^ 

k=0 k=0 

^ / p— 1 n— p n \ 

( 5^ + 5^ + ^ J - l)7n,fc- 

\/c=0 fc=p fc=n— p+l/ 



n + 

Since < 7^ < f, any term of the first and the last sums is bounded 
by AB + 1 and any term of the middle sum is bounded by 2e + e 2 < 3e. 
Therefore, we have 

2p(AB + I) + 3^(n + I - 2p) 2(AB + l)p 

A n < — < 3e + — . 

n+1 n+1 

If n is large enough then the last term is less than e, hence A n < As. 
It remains to find the limit 

1 n 
lim — ^y]7n,fc- 

n-s>oo n+1 z — ' 
fc=0 
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We have 

1 = f n + 2 \bF+b G +i / 1 ^/ j \b F , j Nfc G \ 

n + l^ 7n ' fc U + l/ \n + 2^\n + 2J V 72 + 27 /' 

fc=0 x 3=1 ' 

The first multiplier tends to I. The second one equals the Riemann sum 
of the function f(x) = x bp (l — x) b ° over the unit interval with the partition 

hence it tends to Jq 1 /(x) c&e = £>(&f + 1, &g + !)• Thus, in this case {X(Hn^} 
is regular with the constants 

a H = a F a G B(b F + l,b G + I), b H = b F + b G + l, c H = c. 

3. We proceed to the proof of the second and the third conditions in Def- 
inition 3. 

For * = 1, 2 the difference between Cesaro sums satisfies the relations 
1 JV-l , N-i 1 n -l 

E p (^") - If E p (« = ^ E W> - p («). 

rz=0 n=0 rz=0 

JV-l 1 JV-l M-l 

^ E p (^) ~nI1 p (^) — E ( p (^+«) - p (^))' 

n=0 n=0 n=0 

whence it tends to zero even in operator norm in any L P (X, v), p G [1, oo]. 
For * = 3a, 3b the conditions follows from the identities 

n=0 ^ n=0 ' 

^EV< 3b) )=(^EV..)W). 

n=0 ^ n=0 ' 

The only remaining cases are * = 4, 5. Let us show that we can 
make "approximate" normalisations of operators instead of "precise" ones 
(that is, P( • )) in the second and the third conditions in Definition 3. 
Speaking formally, the following holds. 
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Claim 1. Suppose that the sequence {T n } n , T n G B + , satisfies the condition 

lim = 1 

n^oo an c n 

with some a > 0, b G N, and c > 1. Let 



T 



a(n + l) 6 c n ' 
T/ien for any <p G L P (X, v) the sequences 

1 JV-1 1 AT-l 

n=0 n=0 

converge {in LP or a. e.) simultaneously and their limits coincide. 
Proof. If A(T n ) = 7„, 7„ — » 1, then we have 



IC 



A? 



r" ii - 



N-l 

-J2(P(T n )-f n ) 



N 



n=0 



N-l 



iV 



n=0 



JV-1 



In 



N-l 



< 



<^£i|p(^-|i-7»i<^£|i-7ni, 



n=0 



n=0 



and the latter is the Cesaro sum of x n = |1 — 7„|, which tends to zero. Thus 
the difference Cn — C' N tends to zero in operator norm in any L p (X,u), 
p G [1, oo]. □ 

Now let * = 4. If one of the sequences {F n }, {G n } has only finitely many 
nonzero terms, we can use the lemma's statement for {Hn^} n . Otherwise take 
the constants ap,bp,CF, aa,ba,co same as before and introduce operators 
F n , G n , Hn^ in the same way as in Claim 1. 

From the previous section of the proof one can see that Hn' 1 is either 
^fc^™ + ^T^°n (if c F = c G and b F = b G ), or F n + e n G n with e n ->■ 
(if cq < cp, or if c G = cp and b G < bp), or e n F n + G n (in symmetric cases). 
The convergence of Cesaro sums of in the first case is obvious, in the 
two latter cases the term e n F n (or e n G n ) tends to zero in operator norm: 

\\e n F n \\ < e n (\\F n \\/apn bF c F ) -> • 1, 

and so does the sequence of its Cesaro averages. 
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Finally, suppose * = 5. As usual, the proof is clear if {F n } or {G n } 
contains finitely many nonzero terms, otherwise let a Fj G,H, b F> c,H, Cf,g,h be 
the coefficients in the regularity condition respectively for {A(F„)}, {A(G n )}, 
{A(^ 5) )}. Similarly to the case * = 4, we'll prove convergence for the se- 
quence 



n=0 



There are three cases, c F > cq, c f < cq, and cp = cq. Suppose the first 
one. Then c# = Cf, bn = b F , and for any ip G L P (X, u) we have 



N h N kt<N^ K c f } ^ + m+l) 



N-l 



k+m<N 



N ^ a H 

m=0 k<N-m 



G m ((p) \ 



k + 1 



J \ k + m + 1 



(2) 



V 



Let A be chosen in such a way that ||F fc || < A for all k. Then we have 

k + 1 \ b F 



\s m , N (<p)\\ P < N 

1 



1 E 



k<N-m 



of I, a I, A(G m ) 
— Ffe p — — V 9 



a F . X(G m ) 
v ■ A — ' IW W™ ' 1 ^ 

k<N-rn n F 



E 



. < 

k + m + L 

Aof||</?|| p A(G>, 



Since J2m=o X(G m )c F m < oo, we can choose m such that 



KG m )c F m < e 



a H 



m>mo 



a F \\(p\\p 



Then we have 



^ S m> N(<p) 



m>mo 



< As. 



Further, let us find the limit of S mj N((p) as iV — > oo. Denote 

a F G m (ip) 



an c F 



(3) 



(4) 
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then 



N — m 



N \N -m 
1 



w«o 



k<N-m 



N — m 



Mm 



k+i 



k<N-m 



k + m + 1 



b F 



• (5) 



Due to regularity of the sequence {F k }, the first term in parentheses tends 
in LP or a. e. to a function, which will be denoted as F°(ip m ). Note also that 
the equality 

N-l N-l 



fc=0 



fc=0 



defines a linear operator F° £ B + , with A(F°) = 1. 

The second term in parentheses in (5) is the Cesaro average for the se- 
quence 

r / k -\- 1 \ t> F ~ 

m , k = F k (ip m ) 1 - — — 

L V k + m + 1/ J 

which tends to zero in L P (X, v), p £ [1, oo], as fc — >■ oo. Thus S mi N — > F°(ip m ) 
in LP or a. e. In particular, there exists N m such that for any iV > N m we have 



\\S m , N (<p) ~ F°(ip. 



< 



m + 1 



(6) 



Similarly, for ip £ L°°(X,v), for almost all x there exists N m (x) such that 
for any iV > N m (x) we have 



\S miN (ip)(x) - F°(i; m )(x)\ < 



m + 1 



Note also that (3) yields 

E * 



m>tnij 



II II -M^m) 

^ 2^ — IMIp — — < £ > 

V m>m aU °F 



whence, noting that < 1, we obtain 



mo / co 

E/°(«-^E 

m=0 N m=0 



G m (tp) 



< £. 



(6') 



(7) 
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Now we can see that 



(CO 
m=0 



m>mo 
mo 

m=0 

(f;Fv m )-^°(^f; Gm(v?) 

Vrc=0 ^ rn=0 



°F 



and, if iV > max(iVo, . . . , iV mo ), the estimates (4), (6), (7) give us the in- 
equality 



C N (<p) - F c 



of_ \ ^ G m (ip) 

m=0 r 



< (2 + A)e. 



Similarly, if N > max(N (x), . . . , N mo (x)) for up e L°°(X,is) then (4), (6'), 
and (7) imply 



(oo 
m=0 



G m (<f) 



< (2 + A)e. 



The second case cp < cq is treated similarly. Namely, the sum for CV(y?) 
is decomposed into the sums 



n G v m<N-k 



(8) 



The estimate of its norm for k > ko is the same, and the only difference is 
in the proof of convergence of Sk^i^p) as N — > oo: the argument of in (8) 
tends to G°(ip), so Sk,N tends (in LP or a. e.) to 

a G F k (G°(ip)) 



a H 



G 



with Proposition 2 being used in case (p G L°°(X, v). 
Now consider the third case cp = cq = c. Here 



AT-1 



1 



Fk G m 



n=0 



N ^ a H (k + m) b F +b G+ 1 c k c m 

k+m<N v y 



and the lemma follows from Proposition 3 for X n = F n c n , Y n = G n c n . □ 
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Proposition 3. Let X n , Y n G B + be such that 

1. t/ie sequences {X(X n )/(n + l) u } and {X(Y n )/(n + l) v } are bounded, 

2. for any ip G L p (X,u), p G [l,oo), toe sequences I — ^ 7 T 1 \ <t f 

^ n=0 ^ ' J 

r i Y ( ) ^ 

and < — > — — — — > converqe in L P (X, v) as N — >■ oo, 
l^^(n + l)-J 

3. /or any G L°°(X,v) the sequences ij^^^i — "^l" 1 
i— n ^\t, | converge almost everywhere as N — > oo. 

^ n=0 ^ ' > 

Let Z n = J2k+m= n x kY m , w = u + v + 1. Then 

f 1 JV_1 Z Up) \ 

1. /or any y? G U\X,v), p G [l,oo), toe sequence I — ^ " !• 
converges in L P (X, v), 

f 1 7V_1 Z (</?) 1 

2. /or any </? G L°°(X,u) the sequence — ~ -q w J converges 

almost everywhere. 
Proof. 1. Let 



AT-l „ , „ _ N-l 



X°(P) = lim 1 V X " ( ^ } , F V) = lim - y UV \ • (9) 

n=0 v 1 n=0 v 7 

These operators belong to B + . Indeed, the first two conditions are obvious, 
and, to check the remaining two, one can see that 

llvo, Ml 1 ^ \\X(XM\ p ^ r 1 ^ X(X n ) „ „ 
X u (y?) L < hmsup — > — — — < hm sup — > — \\ip L, 

| V 0/ \ / M /V 1 l^n^)| ... 1 ^ A(X n ) 

^ WW < hmsup — > ? — — — < hmsup — > - — — — \\<p 
iv^oo iV ^ (n + 1)«) tv^oo AT ^ (n + 1)« 

and note that the sequence 

f 1 ^ A(X n ) ) 
is bounded by the same bound as the sequence {X(X n )/(n + l)"} n - 
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2. Now introduce 



X* = X n - (n + 1) U X°, Y: = Y n -(n+ l) v Y°. 

These operators are bounded in any L p (X,u), p G [l,oo], and the norms 
||X*|| p /(n+ l) u , \\Y*\\ p /(n + l) v are bounded uniformly on p G [l,oo] and n 
(indeed, these bounds are simply twice the bounds for ||X„|| p /(n + l) u = 
\{X n )/(n + 1)", ||y„||p/(n + 1)" = \{Y n )/{n + 1)"). This is an analogue 
of the first condition of the proposition; one can see that the second and 
the third conditions hold for X*, Y* in place of X n , Y n . 
Furthermore, 

Z n(<f) X k Y m (tp) 



= E 



(n + l) w ^ (k + m + l) w 

k+m=n 

v xgM) ( v (m + iyx; \ 

^ (k + m + l) w I ^ (k + m + l) w ) y 

k+m=n x fc+m=n ' 

_ x o( (k + i)»Yz(<p) \ v (k + iy(m+iy xOY0 

V ^ (k + m + l) w J (k + m + l) w W> ' 

x k+m=n ' ' k+m=n 



(10) 



To prove Proposition 3, it is sufficient to prove (L p - and a. e.-) convergence 
of Cesaro averages for each term in (10). 

3. For the last term in (10) the proof is simple: 

(fc + l)»(m+l)" _ ,n + 2y ( 1 ^/ j y/ j y\ 
^ (k + m + l) w \n + l) \n + 2^\n + 2J V n + 2J J' 

k+m=n y ' x j=l ' 

Here the first multiplier tends to 1 and the second one is the Riemann sum 
of f(x) = x u (l — x) v with a partition of [0, 1] into n + 2 equal intervals, so 
it tends to the Euler integral B{u + 1, v + 1). Therefore, the last term tends 
to B(u + l,v + l)X°Y°(tp) and so do its Cesaro averages. 

4. To prove convergence of the second and the third terms in (10), it is 
sufficient to prove that Cesaro averages of 



^ (k + m + l) w ^ (k + m + l) w 

k+rn=n k+rn=n 



converge to zero in L P (X, v) for any tp G L P (X, u), p G [1, oo), and a. e. for 
any tp G L°°(X,i/). Indeed, for the second term we denote ip = Y°(tp) and 
for the third one we use either boundedness of the operator X° in L P (X, v) 
or Proposition 2. 
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The expressions in (11) transform to another one when we swap X ■<->■ Y, 
u •<->■ v, and k -B- m, so we may deal only with the first of them. 
Denote 

^4£(F^iP ^ = W 

By construction, p n tends to in L p {X,v) for tp G L p (X,u) and almost 
everywhere for tp G L°°(X, is). Further, 

X* n = (n + l) u ((n + l)A n+1 -nA n ), 

thus 

N-1 



C 



N 



N 



(m + l)"X;(^) 



N^ nj ^ (A; + 771 + 1)' 

n=0 fc+m=n v 



1 (m+l)"(fc+l)",„ ,. , , 

iV E (it + „ + !)■ + 

and, rearranging the sum, we have 

c - V — f V ( m + 1 )" fcu _ ^y^ 1 (m + i)"(fc + i) u 

^"^iVV^ (m + £> ^ (m + A; + l) w 

fc=l x m=0 m=u 

Now we'll use the following statement. 

Claim 2. Lei ajv,fc £ ^> Cfe e where E is a normed space. Suppose that 

1. — >■ as A; — >■ oo ; 

2. /or any /ized TV, t/iere are on/?/ finitely many k 's with a^,k ^ 0, 

3. /or any /ixed fc, -)■ as iV oo ; 

4. t/iere is suca C that ^2 k \dN,k\ < C for any N. 

Then J2k a N,kik — > as iV — > oo. 

Proo/ o/ Claim 2. Let ||^|| < i? for any k. Take any e > and choose fc 
in such a way that ||^|| < e for k > ko. Since 

|o!jv,fe| - > as N — > oo, 

fc<fco 

we can choose iV such that for any N > N 

^2\a Njk \ < e. 

k<ko 
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Therefore, for any iV > N we have 

||^«7v,fe^|| < ^|a7v,fc|||6fc|| = 53l a JV.*IH&ll + $^l aAr ' fc l"^ll - eR + Ce ' 

k k k<ko k>ko 

and the claim is established. □ 

We apply Claim 2 to (12) either with £ k = ip k , S = L p (X,u) 
(if up G L p (X,u)) or with £ k = ip k (x), E = R (if up G L°°(X,u)). Ob- 
viously, £fc — > 0, and we need to check conditions on aN,k, where 



k 

a N , k - ^ 



(N — k + N sp\ m 1)v ( k u _ (k+ir \ 



AT' 

m=0 



(13) 

for fc = 1, ...,JV, otherwise ajv,fc — 0. The value in round brackets is 
of the form /(fc) — f(k + 1) for f(x) = x u / (x + m) w ', so we apply the mean 
value theorem to it. 

There are two cases: w > and w = 0. In the first case, 



k u {k+l) u 



(m + k) w (m + k + 1) 1 



I fl , M _ x u m ]\wn- (v + l)x m \ 



< (k + l)"' 1 M m + + l)x m < (k + I)"' 1 

~~ (m + k) w x m + m ~ (m + A;)™ U 

(here x m G [/c, fc + 1]). Thus we have 

k u+1 (N — k + l) v kw (m + l) v (k + l) u_1 

1°*.*^ + ~W ^ i^TW 

m=0 v 7 

1_ A:(A: + l)"- 1 ^ 1 
-iV + iV ^ (m + A;) u+1 



m=0 



The sum Y^T=k3 * s estimated as 



1 f +oc dx 1 1 / h 1 

< f 1 ' 



k u+l L x u+1 k u+1 uk u ~ \ u) k 
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Continue estimation for |ajvfc| : 



1 / w(l + u) Mk+lV- 1 
\a N ,k\ < T7 1 + 



N 



u 



k u 



N\ u \ k ) J ~ N V 

Hence a.N,k — > as TV — >■ oo for any fixed k , and 

5>«l < i + ■ 2" 



w(l + u) _ T _ x 



u 



-)U— 1 



Thus in the case u > all conditions of Claim 2 hold. 
Now let u — 0. Here 



1 



1 



(m + k) w (m + k + iy 



— w\ 



(x m + m) 



w+l 



< 



W 



(m + k) 



and 



N-k-l 



\CXN,k\ 



1 k \A 
< 1 > 



(m + l)» 
AT ' AT ^ W (k + m) v+2 ~ 

, N-k-l i 

kw \ 1 



AT AT 



m=0 



1 kw 2 _ 1 + 2w 
(k + my-N + Jj'k~ ' 



AT 



hence ajv,fc — > as A" — > oo and ^fel^w.fel — 1 + 2u>. 

5. It remains to consider the first term in (10). Denote 



4» = -£ 

11 ' < 



i ^4 X,! 



n ' (fc + 1)"' 
k=o K ' 



-. n—l 



V* 
1 k 



(fc + 1)"' 

k=0 K 



hence 



X* = (n + l) M ((n + l)A n+1 - nA n ), F„* = (n + l) u ((n + - nB n ). 

Therefore, this term equals 



N-l 



c 



N 



XkKM 

N^ ni ^ (k + m+iy 

n=0 k+m=n 



v £ 



- E 



(k + l) u (m + l) v 



x 



(fc + m + lV 

x ((A; + l)A fc+ i - kA k )((m + l)B rn+1 - mB m )(ip). 
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Rearranging the terms we obtain 2 

~ 1 ^ / k u m" 

C N = — } ( - —[k + m < N + 1 

N ^ \ (k + m-l) wl ~ 1 

k,m>l VV ' 

(k + l) u m v ri _ fc^m + l)" 

+ ( ^+mTt)» 1> " 1 * + m ^- if) 

This sum Cn is decomposed as Cn = C$ + C^, where 

£ ( i) 1 / fcW (A; + 1)W _ 

N ~ N ^ \(k + m-l) w (k + m) w 

fc+m<JV 

- ^ + , ,J )kmA k B m (<p), 14a 



fe,m>l 
fc+ m =7V+l 



„ (fc + l)"(m+l) , . „ , , i ,, „. 
E ( t + m Vi). I 1 *) 

fc,m>l V 7 7 

k+m=N-l 

We'll prove that both 0$ and (7^ tend to zero in L P (X, v) for (p G L P (X, u), 
p G [1, oo), or almost everywhere for G L°°(X, z/). 

Let us start with C$ . Denote g(x, y) = x u y v / (x + y — l) w , then the ex- 
pression in round brackets in (14a) equals 

(g(k, m) - g(k + 1, m)) - (g(k, m + 1) - g(k + 1, m + 1)) 

= -9y(k, n) + g'yik + 1, fj.) = g" xy {n, fj,), 

where x G (k, k + 1), /i G (m,m + 1). (We apply the mean value theorem 
first to ^i(y) = g(k,y) — g(k + l,y) and then to /^(a?) = g' y (x,fJ>)-) One can 



2 Here we use Iverson bracket notation: for any statement .4 



1, A is true, 
0, .4 is false. 
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see that 



vw- 



VW-. r —+W(W + 1)- 



(x + / u-l) w+1 (x + /i - l) w + 2 

As x > A; > 1, // > m > 1, we have x, /i < x + /i — 1, so each fraction 3 is not 
more than l/(x + /i — l) 3 , thus 

.... .. -uf + v w + uw + w(w + 1) 

W X '^ (x + / ,-l) 3 

-uw + + -uw + u>(u> + 1) _ 6 Ui „ 
- (k + m- l) 3 ~ (fc + m- l) 3 ' ^ j 

Now we proceed to an estimation of A k B m ((p). 

Claim 3. I. Let M n = sup fe+m=n || A k B m ((p) \\ p for some ip G LP(X ) v), 
p G [l,oo). Then M n ->■ 0. 

2. Lei M n (ar) = sup k+m=n \(A k B m (<p))(x)\ for some up G L°°(X,v). 
Then M n (x) — > for almost all x G X . 

Proof. 1. Let ||^4fe|| p < C for all fc. Denote y2 m = B m (<p). Since ||y? m ||p — >■ 0, 
for a given e > one can choose m such that ||y? m || p < e for all m > m . 
Then HAi-mVmllp < Ce for m > m , so 

M n < max(||A n ^ ||p, || Ai-i^iIIp, . . . , ||^4„_ mo </7 mo || p , Ce). 

Since ||Ai<£ m || p — > as n — > oo for any fixed m, there are N ,...,N mo 
such that || A n - m (<p m ) \\ p < e for n > N m , m = 0, ...,m . Therefore 
if n > N = max(N , . . . , iV mo ), then M n < max(e, Ce). 
2. Now let y? G L°°(X, i/). Since </? m ^> 0, if we denote 

ip r (x) = max\tp m (x)\, 

m>r 

then tp r 0. Note that Vv^) is nonnegative and nonincreasing sequence 
for any x £ X. 

The operators A k need not belong to B + . But if we denote 



4W = ^£ 



! x„(0) 



iV^ (n + 1)"' 

n=0 v 7 



3 We cannot use this estimate when exponent u — 1 (resp., v — 1) is negative, but then 
u (resp., v) equals zero, and the estimate (15) is simply < for this term. 
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then A+ EB+, and A+(0) ^ X°(9) for any 9 E L°°(X, v) (by definition of 
X°, see (9)). It is also clear that A k = A\ - X°. 
Now define the following "exceptional sets" : 

E 1 = {x | X°(i> r )(x) ^0}, 

E 2 m = {x | A fc (v? m )(:r) 0}, 

£ r 3 = {:r| A fc (^ r )(:r)^0}, 

= | A fc (^ m )(x) -+->• 0}, 

Their measure is zero due to Proposition 2 (for E 1 , E%) and since A k {6) ^—^ q 
(for E 2 m , El). Denote E = E l U (lj m ^) U (lJ r £ r 3 ) U (U^) and prove 
that M n (x) ->■ for any x G X \ £. 

Indeed, take any e > 0. Choose r such that X (-?/v )(x) < e (here we 
use that x E 1 ). Note that since X° e B + , X°(ip r ) > for any r and 
X°{i) r ) < X°(4> ro ) < e for any r > r . 

Now choose fc such that \A k (ip ro )[x)\ < e for any k > k (x E%). 
Then all possible fc's are divided into three classes, each class is estimated 
separately. 

Case 1. Let k = 0, . . . , k . Then, since x ^ E%, there exists such that 
\A k (ip n _ k )(x)\ < e for any n > . Choose 7V« = max(iV (1) , . . . , Nj£). 
Then for any n > iV (1) 

M£\x) = max |A fcJ B m (^)(x)|) < e. 

fc<fco 

Case 2. Let /c = fc + 1, . . . , n — r . Then 

\A k (tp n _ k )(x)\ < \A+(ip n _ k )(x)\ + \X°(<p n - k )(x)\ < 

< A+(^ ro )(x) + X°(^ ro )(x) < 2X°(^ ro )(x) + \A k (^ ro )(x)\ <2e + e = 3e. 
Thus, 

M {2 \x) = max \A k B m (<p)(x)\ < 3e. 

kg<k<n— ro 

Case 3. Let = n — ro + 1, . . . , n. Then, since A n ((p m )(x) n ^°°> 
for any m — 0, . . . , ro — 1 (we use that x ^ E^), one can choose JVi 3 ' such 
that |A n _ m ((/? m )(x)| < e for any n > Nm , m — 0, . . . , ro — 1. Thus, for any 
n > N ( V = max(iV (3) , . . . , N^) 

M<£\x) = max \A k B m (if)(x)\) < e. 

k+m=n 
k>n—ro 
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Putting these estimates together, we obtain that 

M n {x) = max(M (1) (x),M (2) (x),M (3) (:r)) < 3e 

for n > N = max(iV( 1 ), N®). □ 
Combining (15) with Claim 3, we have 

||^(i)|i < 1 V- Q u,vkm 

k,m>l v ' 
k+m<N 

<^ E rT^^+™ = %£ E J % = % i E^ 

JSI k + m — 1 W ' n — 1 W 

k,m>l n=2 k,m>l n=2 

k+m<N k+m=n 

and -^ Yln=2 M n N ~*°°> as Cesaro averages of the sequence {M n }, which 
converges to zero. For a. e. -convergence this proof also works after substitu- 
tion of \c£\x)\ for \\C$\\ and of M n (x) for M n . 
Now we estimate Cffl . 

N 



CP = ± ( E {N+1 -^T mV+1 (N + 1 - m)A N+1 _ m B M 

- E (jV - m) ; ( r irm (iv - 1 - m)A JV _ 1 _ mJ B m (^)) = 

m=l ' 

N-l 

1 \ (N+l-m) u m v+1 (N-m) u (m+l) v m~\ , AT -, \ A o / \ 

= n 2^ [ ^ ^ — J - 1 - m)A 7V _i_ mJ B m (^) 

m=l 

1 ^ (N+ l-m) u m v+1 r/Ar i . ^ n . . 

+ n E " ^ [(iV + 1 - m)A iV+1 _ mJ B m (^) 

m=l 

-(N-l- m)A N _ l _ m B m { V )} + 



(16) 



Convergence of the last term is immediate. For the first term we apply 
Claim 3. Indeed, the expression in square brackets is of the form 

m(f(m) - f(m + 1)), 
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and the mean value theorem yields that (here /i G [to, to + 1]) 
(N + 1 - m) u m v+l (N - m) u (to + l) v m 



i j / 

= J^J I ~ U ( N + 1 - V +v(N + l- /x) V -1 1 < 
>(7V + 1 -u) u ~V v(N+ 1 -a) u a v - 1 \ u + v u + v 

< TO — 1 — < TO——— < 



TV™ N w J ~ N 2 ~ N ' 

whence L p -norm of the first term is bounded by 

JV-l 

A? £ " 1 " ™)\\AN-l-mB m {tp)\\ < 

m=l 

1 N ^ 

m=l 

so it tends to zero. The same argument works for a.e.-convergence, with 
L p -norm being replaced by absolute value of value at x and M^-i being 
replaced by Mjy-i(x). 

As for the second term in (16), the coefficient (N + 1 — m) u m" +1 /N w is 
bounded by 1, and the expression in square brackets equals 

(N + l- m)A N+1 _ m B m (^) — (N — 1 — m)A N -i- m B m (<p) = 

^ + ,/T^., WvO)- (17) 



Denote 



Then the sequence 



(N + l-m) u (N + 2-m) u 

Xk Xk+i 
~ (k + l) u (k + 2) u 



N-l 
k=0 



tends to 2X°(y?) = W°((f) in L p (X,u) (for <p G L p (X,u)) or a. e. 
(for <p G L°°(X, i/)), hence (17) is equal to (Wjv_ m - W°)(.B m (p)). 

Claim 4. 1. //y? G L p (X,z/) ; then 

1 " 



N 

m=l 



toncfe to zero. 
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2. If<pe L°°(A», t/ien 



1 N 

Sx Ln = a? EK^-™ - W°){B m { V )){x) 



' VU ' AT 

m=l 



tends to zero almost everywhere. 

The second term in (16) is estimated by Sn (in L p -norm) or by S^{x) 
(pointwise in absolute value). Hence it remains to prove this claim to com- 
plete the proof of Proposition 3. 

Proof of Claim 4- 1- Let C be a constant such that \\W k — W°\\ p < C for 
all k. Then 

N N 

S N < ^ ^2\\W N . m - W% ■ \\B m (tp)\\ p < -J2\\BM\\ P 

m=l m=l 

the latter is the Cesaro average (multiplied by C) of the sequence ||5 n (<^)|| p , 
which tends to zero. 

2. As in Claim 3, denote y? m = B m (<p), ip r {x) = max m < r |</? m (a;)|. Let 
constants C and R be such that \\Wk — W^°||oo < C for all k and ||v?m||oo < R 
for all m. Define the following "exceptional sets" 

E 1 = {x | W°(i> r )(x) ^0}, 

N-1 

E 2 r = {x\^J2^- W °^)(x) k -^0}. 

and let E = E 1 U ([j r E r)- 

Fix any x E X\E and take any e > 0. Choose r such that W°(ip ro ) < e. 
Then 

^ ( x ) = at ( E + E J i - (*) i ^ 

< 4e iw^_ m ^ m )(x)i + i^°(^)wi < 



iV AT 

m=ro 
AT 

.V A' 



< ^ + ^ E (Wn-MM + W°(A )(X)) < 



m=ro 
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m=l 



N vr,0/v ' N 

k=0 

Here the first term tends to zero as iV — > oo, the second one is less than 2e, 
and the last one also tends to zero (since x ^ -Ey )- Hence for sufficiently 
large iV one has Sn(x) < 3e. □ 

Therefore Proposition 3 is completely proven. This completes the proofs 
of Lemma 1, Theorem 2, and Theorem 1. □ 



References 

[1] V. I. Arnold and A. L. Krylov, Equidistribution of points on a sphere 
and ergodic properties of solutions of ordinary differential equations in 
a complex domain, Dokl Akad. Nauk SSSR 148 (1963), 9-12. 

[2] L. Bowen, Invariant measures on the space of horofunctions of a word 
hyperbolic group, Ergodic Theory Dynam. Systems 30 (2010), no. 1, 
97-129. 

[3] L. Bowen, A. Nevo, Geometric covering arguments and ergodic theorems 
for free groups, arXiv:0912.4953v2 [math.DS]. 

[4] A.I. Bufetov, Ergodic theorems for actions of several mappings, (Rus- 
sian) Uspekhi Mat. Nauk, 54 (1999), no. 4 (328), 159-160, translation 
in Russian Math. Surveys, 54 (1999), no. 4, 835-836. 

[5] A.I. Bufetov, Operator ergodic theorems for actions of free semigroups 
and groups, Funct. Anal. Appl. 34 (2000), 239-251. 

[6] A.I. Bufetov, Markov averaging and ergodic theorems for several op- 
erators, in Topology, Ergodic Theory, and Algebraic Geometry, AMS 
Transl. 202 (2001), 39-50. 

[7] A.I. Bufetov, Convergence of spherical averages for actions of free 
groups. Ann. Math., 155 (2002), 929-944. 

[8] A.I. Bufetov, C. Series, A pointwise ergodic theorem for Fuchsian groups, 
arXiv:1010.3362vl [math.DS]. 



30 



J. Cannon, The combinatorial structure of cocompact discrete hyper- 
bolic groups. Geom. Dedicata, 16 (1984), no. 2, 123-148. 

K. Fujiwara and A. Nevo, Maximal and pointwise ergodic theorems for 
word-hyperbolic groups, Ergodic Theory Dynam. Systems 18 (1998), 
843-858. 

Sur les groupes hyperboliques d'apres Mikhael Gromov. Papers from 
the Swiss Seminar on Hyperbolic Groups held in Bern, 1988. Edited by 
E. Ghys and P. de la Harpe. Progress in Mathematics, 83. Birkhauser 
Boston, Inc., Boston, MA, 1990. 

R. I. Grigorchuk, Pointwise ergodic theorems for actions of free groups, 
Proc. Tambov Workshop in the Theory of Functions, 1986. 

R. I. Grigorchuk, Ergodic theorems for actions of free semigroups and 
groups, Math. Notes, 65 (1999), 654-657. 

R. I. Grigorchuk. An ergodic theorem for actions of a free semigroup. 
(Russian) Tr. Mat. Inst. Steklova 231 (2000), Din. Sist., Avtom. i 
Beskon. Gruppy, 119-133; translation in Proc. Steklov Inst. Math. 2000, 
no. 4 (231), 113-127. 

M. Gromov, Hyperbolic groups, in Essays in Group Theory, MSRI Publ. 
8 (1987), 75-263, Springer- Verlag, New York. 

Y. Guivarc'h, Generalisation d'un theoreme de von Neumann, C. R. 
Acad. Sci. Paris Ser. A-B 268 (1969), 1020-1023. 

A. Nevo, Harmonic analysis and pointwise ergodic theorems for noncom- 
muting transformations, J. Amer. Math. Soc. 7 (1994), no. 4, 875-902. 

A. Nevo, Pointwise ergodic theorems for actions of groups, in Handbook 
of dynamical systems, Vol. IB, 871-982, Elsevier B. V., Amsterdam, 
2006. 

A. Nevo and E. M. Stein, A generalization of Birkhoff's pointwise ergodic 
theorem, Acta Math. 173 (1994), 135-154. 

D. Ornstein, On the pointwise behavior of iterates of a self-adjoint op- 
erator, J. Math. Mech. 18 (1968/1969) 473-477. 

V. I. Oseledets, Markov chains, skew-products, and ergodic theorems for 
general dynamical systems, Th. Prob. App. 10 (1965), 551-557. 



31 



[22] G.-C. Rota, An "Alternierende Verfahren" for general positive operators, 
Bull. A. M. S. 68 (1962), 95-102. 

[23] C. Series, Geometrical Markov coding of geodesies on surfaces of con- 
stant negative curvature, Ergodic Theory Dynam. Systems, 6 (1986), 
601-625. 

[24] E. M. Stein, On the maximal ergodic theorem, Proc. Nat. Acad. Sci. 
USA 47 (1961), 1894-1897. 



32 



